Dataset statistics
| Number of variables | 13 |
|---|---|
| Number of observations | 506 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 51.5 KiB |
| Average record size in memory | 104.3 B |
Variable types
| Numeric | 12 |
|---|---|
| Categorical | 1 |
RAD is highly correlated with TAX | High correlation |
TAX is highly correlated with RAD | High correlation |
ZN has 372 (73.5%) zeros | Zeros |
Reproduction
| Analysis started | 2021-01-09 02:55:42.983406 |
|---|---|
| Analysis finished | 2021-01-09 02:56:16.706606 |
| Duration | 33.72 seconds |
| Software version | pandas-profiling v2.10.0 |
| Download configuration | config.yaml |
CRIM
Real number (ℝ≥0)
| Distinct | 504 |
|---|---|
| Distinct (%) | 99.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.613523557 |
|---|---|
| Minimum | 0.00632 |
| Maximum | 88.9762 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 0.00632 |
|---|---|
| 5-th percentile | 0.02791 |
| Q1 | 0.082045 |
| median | 0.25651 |
| Q3 | 3.6770825 |
| 95-th percentile | 15.78915 |
| Maximum | 88.9762 |
| Range | 88.96988 |
| Interquartile range (IQR) | 3.5950375 |
Descriptive statistics
| Standard deviation | 8.601545105 |
|---|---|
| Coefficient of variation (CV) | 2.380376098 |
| Kurtosis | 37.13050913 |
| Mean | 3.613523557 |
| Median Absolute Deviation (MAD) | 0.22145 |
| Skewness | 5.223148798 |
| Sum | 1828.44292 |
| Variance | 73.9865782 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.01501 | 2 | 0.4% |
| 14.3337 | 2 | 0.4% |
| 0.57834 | 1 | 0.2% |
| 0.06127 | 1 | 0.2% |
| 0.03548 | 1 | 0.2% |
| 0.1403 | 1 | 0.2% |
| 0.03705 | 1 | 0.2% |
| 0.95577 | 1 | 0.2% |
| 0.11747 | 1 | 0.2% |
| 0.03537 | 1 | 0.2% |
| Other values (494) | 494 |
| Value | Count | Frequency (%) |
| 0.00632 | 1 | |
| 0.00906 | 1 | |
| 0.01096 | 1 | |
| 0.01301 | 1 | |
| 0.01311 | 1 |
| Value | Count | Frequency (%) |
| 88.9762 | 1 | |
| 73.5341 | 1 | |
| 67.9208 | 1 | |
| 51.1358 | 1 | |
| 45.7461 | 1 |
| Distinct | 26 |
|---|---|
| Distinct (%) | 5.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.36363636 |
|---|---|
| Minimum | 0 |
| Maximum | 100 |
| Zeros | 372 |
| Zeros (%) | 73.5% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 12.5 |
| 95-th percentile | 80 |
| Maximum | 100 |
| Range | 100 |
| Interquartile range (IQR) | 12.5 |
Descriptive statistics
| Standard deviation | 23.32245299 |
|---|---|
| Coefficient of variation (CV) | 2.052375864 |
| Kurtosis | 4.031510084 |
| Mean | 11.36363636 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.225666323 |
| Sum | 5750 |
| Variance | 543.9368137 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 372 | |
| 20 | 21 | 4.2% |
| 80 | 15 | 3.0% |
| 12.5 | 10 | 2.0% |
| 25 | 10 | 2.0% |
| 22 | 10 | 2.0% |
| 40 | 7 | 1.4% |
| 30 | 6 | 1.2% |
| 45 | 6 | 1.2% |
| 90 | 5 | 1.0% |
| Other values (16) | 44 | 8.7% |
| Value | Count | Frequency (%) |
| 0 | 372 | |
| 12.5 | 10 | 2.0% |
| 17.5 | 1 | 0.2% |
| 18 | 1 | 0.2% |
| 20 | 21 | 4.2% |
| Value | Count | Frequency (%) |
| 100 | 1 | 0.2% |
| 95 | 4 | |
| 90 | 5 | |
| 85 | 2 | 0.4% |
| 82.5 | 2 | 0.4% |
INDUS
Real number (ℝ≥0)
| Distinct | 76 |
|---|---|
| Distinct (%) | 15.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.13677866 |
|---|---|
| Minimum | 0.46 |
| Maximum | 27.74 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 0.46 |
|---|---|
| 5-th percentile | 2.18 |
| Q1 | 5.19 |
| median | 9.69 |
| Q3 | 18.1 |
| 95-th percentile | 21.89 |
| Maximum | 27.74 |
| Range | 27.28 |
| Interquartile range (IQR) | 12.91 |
Descriptive statistics
| Standard deviation | 6.860352941 |
|---|---|
| Coefficient of variation (CV) | 0.6160087358 |
| Kurtosis | -1.233539601 |
| Mean | 11.13677866 |
| Median Absolute Deviation (MAD) | 6.32 |
| Skewness | 0.2950215679 |
| Sum | 5635.21 |
| Variance | 47.06444247 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 18.1 | 132 | |
| 19.58 | 30 | 5.9% |
| 8.14 | 22 | 4.3% |
| 6.2 | 18 | 3.6% |
| 21.89 | 15 | 3.0% |
| 9.9 | 12 | 2.4% |
| 3.97 | 12 | 2.4% |
| 10.59 | 11 | 2.2% |
| 8.56 | 11 | 2.2% |
| 5.86 | 10 | 2.0% |
| Other values (66) | 233 |
| Value | Count | Frequency (%) |
| 0.46 | 1 | |
| 0.74 | 1 | |
| 1.21 | 1 | |
| 1.22 | 1 | |
| 1.25 | 2 |
| Value | Count | Frequency (%) |
| 27.74 | 5 | 1.0% |
| 25.65 | 7 | 1.4% |
| 21.89 | 15 | 3.0% |
| 19.58 | 30 | 5.9% |
| 18.1 | 132 |
CHAS
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.1 KiB |
| 0.0 | |
|---|---|
| 1.0 | 35 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1518 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
| Value | Count | Frequency (%) |
| 0.0 | 471 | |
| 1.0 | 35 | 6.9% |
| Value | Count | Frequency (%) |
| 0.0 | 471 | |
| 1.0 | 35 | 6.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 977 | |
| . | 506 | |
| 1 | 35 | 2.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1012 | |
| Other Punctuation | 506 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 977 | |
| 1 | 35 | 3.5% |
| Value | Count | Frequency (%) |
| . | 506 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1518 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 977 | |
| . | 506 | |
| 1 | 35 | 2.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1518 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 977 | |
| . | 506 | |
| 1 | 35 | 2.3% |
NOX
Real number (ℝ≥0)
| Distinct | 81 |
|---|---|
| Distinct (%) | 16.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.5546950593 |
|---|---|
| Minimum | 0.385 |
| Maximum | 0.871 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 0.385 |
|---|---|
| 5-th percentile | 0.40925 |
| Q1 | 0.449 |
| median | 0.538 |
| Q3 | 0.624 |
| 95-th percentile | 0.74 |
| Maximum | 0.871 |
| Range | 0.486 |
| Interquartile range (IQR) | 0.175 |
Descriptive statistics
| Standard deviation | 0.1158776757 |
|---|---|
| Coefficient of variation (CV) | 0.2089033853 |
| Kurtosis | -0.06466713337 |
| Mean | 0.5546950593 |
| Median Absolute Deviation (MAD) | 0.0875 |
| Skewness | 0.7293079225 |
| Sum | 280.6757 |
| Variance | 0.01342763572 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.538 | 23 | 4.5% |
| 0.713 | 18 | 3.6% |
| 0.437 | 17 | 3.4% |
| 0.871 | 16 | 3.2% |
| 0.624 | 15 | 3.0% |
| 0.489 | 15 | 3.0% |
| 0.605 | 14 | 2.8% |
| 0.693 | 14 | 2.8% |
| 0.74 | 13 | 2.6% |
| 0.544 | 12 | 2.4% |
| Other values (71) | 349 |
| Value | Count | Frequency (%) |
| 0.385 | 1 | |
| 0.389 | 1 | |
| 0.392 | 2 | |
| 0.394 | 1 | |
| 0.398 | 2 |
| Value | Count | Frequency (%) |
| 0.871 | 16 | |
| 0.77 | 8 | |
| 0.74 | 13 | |
| 0.718 | 6 | 1.2% |
| 0.713 | 18 |
RM
Real number (ℝ≥0)
| Distinct | 446 |
|---|---|
| Distinct (%) | 88.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.284634387 |
|---|---|
| Minimum | 3.561 |
| Maximum | 8.78 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 3.561 |
|---|---|
| 5-th percentile | 5.314 |
| Q1 | 5.8855 |
| median | 6.2085 |
| Q3 | 6.6235 |
| 95-th percentile | 7.5875 |
| Maximum | 8.78 |
| Range | 5.219 |
| Interquartile range (IQR) | 0.738 |
Descriptive statistics
| Standard deviation | 0.7026171434 |
|---|---|
| Coefficient of variation (CV) | 0.1117992074 |
| Kurtosis | 1.891500366 |
| Mean | 6.284634387 |
| Median Absolute Deviation (MAD) | 0.3455 |
| Skewness | 0.4036121333 |
| Sum | 3180.025 |
| Variance | 0.4936708502 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 6.167 | 3 | 0.6% |
| 6.405 | 3 | 0.6% |
| 5.713 | 3 | 0.6% |
| 6.417 | 3 | 0.6% |
| 6.127 | 3 | 0.6% |
| 6.229 | 3 | 0.6% |
| 5.39 | 2 | 0.4% |
| 5.304 | 2 | 0.4% |
| 6.968 | 2 | 0.4% |
| 6.009 | 2 | 0.4% |
| Other values (436) | 480 |
| Value | Count | Frequency (%) |
| 3.561 | 1 | |
| 3.863 | 1 | |
| 4.138 | 2 | |
| 4.368 | 1 | |
| 4.519 | 1 |
| Value | Count | Frequency (%) |
| 8.78 | 1 | |
| 8.725 | 1 | |
| 8.704 | 1 | |
| 8.398 | 1 | |
| 8.375 | 1 |
AGE
Real number (ℝ≥0)
| Distinct | 356 |
|---|---|
| Distinct (%) | 70.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 68.57490119 |
|---|---|
| Minimum | 2.9 |
| Maximum | 100 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 2.9 |
|---|---|
| 5-th percentile | 17.725 |
| Q1 | 45.025 |
| median | 77.5 |
| Q3 | 94.075 |
| 95-th percentile | 100 |
| Maximum | 100 |
| Range | 97.1 |
| Interquartile range (IQR) | 49.05 |
Descriptive statistics
| Standard deviation | 28.14886141 |
|---|---|
| Coefficient of variation (CV) | 0.410483441 |
| Kurtosis | -0.9677155942 |
| Mean | 68.57490119 |
| Median Absolute Deviation (MAD) | 19.55 |
| Skewness | -0.5989626399 |
| Sum | 34698.9 |
| Variance | 792.3583985 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 100 | 43 | 8.5% |
| 97.9 | 4 | 0.8% |
| 96 | 4 | 0.8% |
| 95.4 | 4 | 0.8% |
| 98.2 | 4 | 0.8% |
| 87.9 | 4 | 0.8% |
| 98.8 | 4 | 0.8% |
| 97.4 | 3 | 0.6% |
| 94.1 | 3 | 0.6% |
| 96.2 | 3 | 0.6% |
| Other values (346) | 430 |
| Value | Count | Frequency (%) |
| 2.9 | 1 | |
| 6 | 1 | |
| 6.2 | 1 | |
| 6.5 | 1 | |
| 6.6 | 2 |
| Value | Count | Frequency (%) |
| 100 | 43 | |
| 99.3 | 1 | 0.2% |
| 99.1 | 1 | 0.2% |
| 98.9 | 3 | 0.6% |
| 98.8 | 4 | 0.8% |
DIS
Real number (ℝ≥0)
| Distinct | 412 |
|---|---|
| Distinct (%) | 81.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.795042688 |
|---|---|
| Minimum | 1.1296 |
| Maximum | 12.1265 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 1.1296 |
|---|---|
| 5-th percentile | 1.461975 |
| Q1 | 2.100175 |
| median | 3.20745 |
| Q3 | 5.188425 |
| 95-th percentile | 7.8278 |
| Maximum | 12.1265 |
| Range | 10.9969 |
| Interquartile range (IQR) | 3.08825 |
Descriptive statistics
| Standard deviation | 2.105710127 |
|---|---|
| Coefficient of variation (CV) | 0.5548580872 |
| Kurtosis | 0.4879411222 |
| Mean | 3.795042688 |
| Median Absolute Deviation (MAD) | 1.29115 |
| Skewness | 1.011780579 |
| Sum | 1920.2916 |
| Variance | 4.434015137 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 3.4952 | 5 | 1.0% |
| 5.7209 | 4 | 0.8% |
| 5.2873 | 4 | 0.8% |
| 6.8147 | 4 | 0.8% |
| 5.4007 | 4 | 0.8% |
| 7.8278 | 3 | 0.6% |
| 3.9454 | 3 | 0.6% |
| 7.309 | 3 | 0.6% |
| 5.4917 | 3 | 0.6% |
| 6.4798 | 3 | 0.6% |
| Other values (402) | 470 |
| Value | Count | Frequency (%) |
| 1.1296 | 1 | |
| 1.137 | 1 | |
| 1.1691 | 1 | |
| 1.1742 | 1 | |
| 1.1781 | 1 |
| Value | Count | Frequency (%) |
| 12.1265 | 1 | |
| 10.7103 | 2 | |
| 10.5857 | 2 | |
| 9.2229 | 1 | |
| 9.2203 | 2 |
| Distinct | 9 |
|---|---|
| Distinct (%) | 1.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.549407115 |
|---|---|
| Minimum | 1 |
| Maximum | 24 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 4 |
| median | 5 |
| Q3 | 24 |
| 95-th percentile | 24 |
| Maximum | 24 |
| Range | 23 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 8.707259384 |
|---|---|
| Coefficient of variation (CV) | 0.9118115166 |
| Kurtosis | -0.8672319936 |
| Mean | 9.549407115 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 1.004814648 |
| Sum | 4832 |
| Variance | 75.81636598 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 24 | 132 | |
| 5 | 115 | |
| 4 | 110 | |
| 3 | 38 | 7.5% |
| 6 | 26 | 5.1% |
| 2 | 24 | 4.7% |
| 8 | 24 | 4.7% |
| 1 | 20 | 4.0% |
| 7 | 17 | 3.4% |
| Value | Count | Frequency (%) |
| 1 | 20 | 4.0% |
| 2 | 24 | 4.7% |
| 3 | 38 | 7.5% |
| 4 | 110 | |
| 5 | 115 |
| Value | Count | Frequency (%) |
| 24 | 132 | |
| 8 | 24 | 4.7% |
| 7 | 17 | 3.4% |
| 6 | 26 | 5.1% |
| 5 | 115 |
| Distinct | 66 |
|---|---|
| Distinct (%) | 13.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 408.2371542 |
|---|---|
| Minimum | 187 |
| Maximum | 711 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 187 |
|---|---|
| 5-th percentile | 222 |
| Q1 | 279 |
| median | 330 |
| Q3 | 666 |
| 95-th percentile | 666 |
| Maximum | 711 |
| Range | 524 |
| Interquartile range (IQR) | 387 |
Descriptive statistics
| Standard deviation | 168.5371161 |
|---|---|
| Coefficient of variation (CV) | 0.4128411987 |
| Kurtosis | -1.142407992 |
| Mean | 408.2371542 |
| Median Absolute Deviation (MAD) | 73 |
| Skewness | 0.6699559418 |
| Sum | 206568 |
| Variance | 28404.75949 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 666 | 132 | |
| 307 | 40 | 7.9% |
| 403 | 30 | 5.9% |
| 437 | 15 | 3.0% |
| 304 | 14 | 2.8% |
| 264 | 12 | 2.4% |
| 398 | 12 | 2.4% |
| 277 | 11 | 2.2% |
| 384 | 11 | 2.2% |
| 330 | 10 | 2.0% |
| Other values (56) | 219 |
| Value | Count | Frequency (%) |
| 187 | 1 | 0.2% |
| 188 | 7 | |
| 193 | 8 | |
| 198 | 1 | 0.2% |
| 216 | 5 |
| Value | Count | Frequency (%) |
| 711 | 5 | 1.0% |
| 666 | 132 | |
| 469 | 1 | 0.2% |
| 437 | 15 | 3.0% |
| 432 | 9 | 1.8% |
PTRATIO
Real number (ℝ≥0)
| Distinct | 46 |
|---|---|
| Distinct (%) | 9.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 18.4555336 |
|---|---|
| Minimum | 12.6 |
| Maximum | 22 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 12.6 |
|---|---|
| 5-th percentile | 14.7 |
| Q1 | 17.4 |
| median | 19.05 |
| Q3 | 20.2 |
| 95-th percentile | 21 |
| Maximum | 22 |
| Range | 9.4 |
| Interquartile range (IQR) | 2.8 |
Descriptive statistics
| Standard deviation | 2.164945524 |
|---|---|
| Coefficient of variation (CV) | 0.1173060379 |
| Kurtosis | -0.2850913833 |
| Mean | 18.4555336 |
| Median Absolute Deviation (MAD) | 1.15 |
| Skewness | -0.8023249269 |
| Sum | 9338.5 |
| Variance | 4.686989121 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 20.2 | 140 | |
| 14.7 | 34 | 6.7% |
| 21 | 27 | 5.3% |
| 17.8 | 23 | 4.5% |
| 19.2 | 19 | 3.8% |
| 17.4 | 18 | 3.6% |
| 18.6 | 17 | 3.4% |
| 19.1 | 17 | 3.4% |
| 16.6 | 16 | 3.2% |
| 18.4 | 16 | 3.2% |
| Other values (36) | 179 |
| Value | Count | Frequency (%) |
| 12.6 | 3 | 0.6% |
| 13 | 12 | 2.4% |
| 13.6 | 1 | 0.2% |
| 14.4 | 1 | 0.2% |
| 14.7 | 34 |
| Value | Count | Frequency (%) |
| 22 | 2 | 0.4% |
| 21.2 | 15 | |
| 21.1 | 1 | 0.2% |
| 21 | 27 | |
| 20.9 | 11 |
B
Real number (ℝ≥0)
| Distinct | 357 |
|---|---|
| Distinct (%) | 70.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 356.6740316 |
|---|---|
| Minimum | 0.32 |
| Maximum | 396.9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 0.32 |
|---|---|
| 5-th percentile | 84.59 |
| Q1 | 375.3775 |
| median | 391.44 |
| Q3 | 396.225 |
| 95-th percentile | 396.9 |
| Maximum | 396.9 |
| Range | 396.58 |
| Interquartile range (IQR) | 20.8475 |
Descriptive statistics
| Standard deviation | 91.29486438 |
|---|---|
| Coefficient of variation (CV) | 0.255961624 |
| Kurtosis | 7.226817549 |
| Mean | 356.6740316 |
| Median Absolute Deviation (MAD) | 5.46 |
| Skewness | -2.890373712 |
| Sum | 180477.06 |
| Variance | 8334.752263 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 396.9 | 121 | 23.9% |
| 395.24 | 3 | 0.6% |
| 393.74 | 3 | 0.6% |
| 394.12 | 2 | 0.4% |
| 395.56 | 2 | 0.4% |
| 390.94 | 2 | 0.4% |
| 388.45 | 2 | 0.4% |
| 393.23 | 2 | 0.4% |
| 396.21 | 2 | 0.4% |
| 393.37 | 2 | 0.4% |
| Other values (347) | 365 |
| Value | Count | Frequency (%) |
| 0.32 | 1 | |
| 2.52 | 1 | |
| 2.6 | 1 | |
| 3.5 | 1 | |
| 3.65 | 1 |
| Value | Count | Frequency (%) |
| 396.9 | 121 | |
| 396.42 | 1 | 0.2% |
| 396.33 | 1 | 0.2% |
| 396.3 | 1 | 0.2% |
| 396.28 | 1 | 0.2% |
LSTAT
Real number (ℝ≥0)
| Distinct | 455 |
|---|---|
| Distinct (%) | 89.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12.65306324 |
|---|---|
| Minimum | 1.73 |
| Maximum | 37.97 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 1.73 |
|---|---|
| 5-th percentile | 3.7075 |
| Q1 | 6.95 |
| median | 11.36 |
| Q3 | 16.955 |
| 95-th percentile | 26.8075 |
| Maximum | 37.97 |
| Range | 36.24 |
| Interquartile range (IQR) | 10.005 |
Descriptive statistics
| Standard deviation | 7.141061511 |
|---|---|
| Coefficient of variation (CV) | 0.5643741263 |
| Kurtosis | 0.4932395174 |
| Mean | 12.65306324 |
| Median Absolute Deviation (MAD) | 4.795 |
| Skewness | 0.9064600936 |
| Sum | 6402.45 |
| Variance | 50.99475951 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 8.05 | 3 | 0.6% |
| 6.36 | 3 | 0.6% |
| 18.13 | 3 | 0.6% |
| 14.1 | 3 | 0.6% |
| 7.79 | 3 | 0.6% |
| 18.46 | 2 | 0.4% |
| 9.97 | 2 | 0.4% |
| 5.33 | 2 | 0.4% |
| 10.45 | 2 | 0.4% |
| 6.72 | 2 | 0.4% |
| Other values (445) | 481 |
| Value | Count | Frequency (%) |
| 1.73 | 1 | |
| 1.92 | 1 | |
| 1.98 | 1 | |
| 2.47 | 1 | |
| 2.87 | 1 |
| Value | Count | Frequency (%) |
| 37.97 | 1 | |
| 36.98 | 1 | |
| 34.77 | 1 | |
| 34.41 | 1 | |
| 34.37 | 1 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| CRIM | ZN | INDUS | CHAS | NOX | RM | AGE | DIS | RAD | TAX | PTRATIO | B | LSTAT | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0.00632 | 18.0 | 2.31 | 0.0 | 0.538 | 6.575 | 65.2 | 4.0900 | 1.0 | 296.0 | 15.3 | 396.90 | 4.98 |
| 1 | 0.02731 | 0.0 | 7.07 | 0.0 | 0.469 | 6.421 | 78.9 | 4.9671 | 2.0 | 242.0 | 17.8 | 396.90 | 9.14 |
| 2 | 0.02729 | 0.0 | 7.07 | 0.0 | 0.469 | 7.185 | 61.1 | 4.9671 | 2.0 | 242.0 | 17.8 | 392.83 | 4.03 |
| 3 | 0.03237 | 0.0 | 2.18 | 0.0 | 0.458 | 6.998 | 45.8 | 6.0622 | 3.0 | 222.0 | 18.7 | 394.63 | 2.94 |
| 4 | 0.06905 | 0.0 | 2.18 | 0.0 | 0.458 | 7.147 | 54.2 | 6.0622 | 3.0 | 222.0 | 18.7 | 396.90 | 5.33 |
| 5 | 0.02985 | 0.0 | 2.18 | 0.0 | 0.458 | 6.430 | 58.7 | 6.0622 | 3.0 | 222.0 | 18.7 | 394.12 | 5.21 |
| 6 | 0.08829 | 12.5 | 7.87 | 0.0 | 0.524 | 6.012 | 66.6 | 5.5605 | 5.0 | 311.0 | 15.2 | 395.60 | 12.43 |
| 7 | 0.14455 | 12.5 | 7.87 | 0.0 | 0.524 | 6.172 | 96.1 | 5.9505 | 5.0 | 311.0 | 15.2 | 396.90 | 19.15 |
| 8 | 0.21124 | 12.5 | 7.87 | 0.0 | 0.524 | 5.631 | 100.0 | 6.0821 | 5.0 | 311.0 | 15.2 | 386.63 | 29.93 |
| 9 | 0.17004 | 12.5 | 7.87 | 0.0 | 0.524 | 6.004 | 85.9 | 6.5921 | 5.0 | 311.0 | 15.2 | 386.71 | 17.10 |
Last rows
| CRIM | ZN | INDUS | CHAS | NOX | RM | AGE | DIS | RAD | TAX | PTRATIO | B | LSTAT | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 496 | 0.28960 | 0.0 | 9.69 | 0.0 | 0.585 | 5.390 | 72.9 | 2.7986 | 6.0 | 391.0 | 19.2 | 396.90 | 21.14 |
| 497 | 0.26838 | 0.0 | 9.69 | 0.0 | 0.585 | 5.794 | 70.6 | 2.8927 | 6.0 | 391.0 | 19.2 | 396.90 | 14.10 |
| 498 | 0.23912 | 0.0 | 9.69 | 0.0 | 0.585 | 6.019 | 65.3 | 2.4091 | 6.0 | 391.0 | 19.2 | 396.90 | 12.92 |
| 499 | 0.17783 | 0.0 | 9.69 | 0.0 | 0.585 | 5.569 | 73.5 | 2.3999 | 6.0 | 391.0 | 19.2 | 395.77 | 15.10 |
| 500 | 0.22438 | 0.0 | 9.69 | 0.0 | 0.585 | 6.027 | 79.7 | 2.4982 | 6.0 | 391.0 | 19.2 | 396.90 | 14.33 |
| 501 | 0.06263 | 0.0 | 11.93 | 0.0 | 0.573 | 6.593 | 69.1 | 2.4786 | 1.0 | 273.0 | 21.0 | 391.99 | 9.67 |
| 502 | 0.04527 | 0.0 | 11.93 | 0.0 | 0.573 | 6.120 | 76.7 | 2.2875 | 1.0 | 273.0 | 21.0 | 396.90 | 9.08 |
| 503 | 0.06076 | 0.0 | 11.93 | 0.0 | 0.573 | 6.976 | 91.0 | 2.1675 | 1.0 | 273.0 | 21.0 | 396.90 | 5.64 |
| 504 | 0.10959 | 0.0 | 11.93 | 0.0 | 0.573 | 6.794 | 89.3 | 2.3889 | 1.0 | 273.0 | 21.0 | 393.45 | 6.48 |
| 505 | 0.04741 | 0.0 | 11.93 | 0.0 | 0.573 | 6.030 | 80.8 | 2.5050 | 1.0 | 273.0 | 21.0 | 396.90 | 7.88 |